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We report on exact results for the degree K, the diameter D, the clustering coefficient C, and 
the betweenness centrality B of a hierarchical network model with a replication factor M. Such 
quantities are calculated exactly with the help of recursion relations. Using the results, we show 
that (i) the degree distribution follows a power law Pk ~ K ' with 7=1 + In A//ln(M — 1), (ii) 
the diameter grows logarithmically as D ~ In N with the number of nodes N, (iii) the clustering 
coefficient of each node is inversely proportional to its degree, C oc 1/K, and the average clustering 
coefficient is nonzero in the infinite N limit, and (iv) the betweenness centrality distribution follows a 
power law Pb ~ B~ 2 . We discuss a classification scheme of scale-free networks into the universality 
class with the clustering property and the betweenness centrality distribution. 

PACS numbers: 89.75.-k, 89.75.Da, 05.10.-a 
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A network structure of complex systems has been at- 
tracting much research interest Q. Since the works in 
Refs. 0,0], it has been recognized that complex systems 
have neither a regular nor a random network structure. 
Instead, complex networks found in various areas have 
a scale-free (SF) structure characterized with a power- 
law distribution of the degree, see Ref. |Q and references 
therein. A possible mechanism for the emerging SF struc- 
ture is suggested by the Barabasi- Albert (BA) model [Q . 
However, the BA network lacks the clustering property 
which is observed in many real networks. To fill this 
gap, a hierarchical (H) network model was introduced as 
a model for SF networks with a clustering property [^J. 
It is observed that the H network displays a scaling law 
C ~ A"" 1 between the clustering coefficient C of a node 
and its degree K and that the average value of C does not 
vanish in the infinite size limit. The clustering property 
is also observed in some metabolic networks , which is 
regarded as an evidence for a hierarchical structure |5|, Q . 

On the other hand, the clustering property does not 
necessarily imply a hierarchical structure. Some SF net- 
work models ^, [j| [n], [llj have the clustering property, 
but it is not apparent whether they have a hierarchical 
structure. Therefore, it is desirable to study other scaling 
properties of the H network model to establish the uni- 
versality class for the hierarchical network. This is the 
purpose of the current work. We derive analytically exact 
scaling laws for the degree distribution, the diameter D, 
and the clustering coefficient (abbreviated to CC), and 
the betweenness centrality (abbreviated to BC) B ||. 
The CC is a measure of local connectivity or modular- 
ity, while the BC reflects a global property |l2|, [u| . The 
scaling property of both quantities characterizes the uni- 
versality class for the H network. 

The H network is characterized by G (the number of 
generations) and M (a replication factor). The network 
of the Gth generation, denoted as A/g, has N = M G 
nodes, which will be labelled by the coordinate G-tuple 
of integers [xq] = [xg ■ ■ ■ x\] with < xt < M. The net- 
work is defined recursively. The first generation consists 
of one central node [0] , which will be referred to as a hub, 



and (M — 1) peripheral nodes [y] with 1 < y < M. All 
nodes are fully connected with each other. Suppose one 
has A/g-i, each node of which is assigned to a coordi- 
nate [jeq_i]. In the next generation, (M — 1) copies are 
added to the network and all their peripheral nodes are 
connected to the hub of the original unit. Nodes in the 
original unit are assigned to [Oxg-i], and the copies to 
[yxo-i] with y = 1, . . . , M — 1, respectively. It leads to 
the network Ng- Nodes [yc ■■■yi] with y- L ^ for all i 
are the peripheral nodes and [Oq] is the hub of A/g- 

With the coordinate system, geometrical properties of 
the network can be studied combinatorially. In general, 
the node connectivity follows the rule: 



CI: [x G ■ ■ ■ x i+1 yi ■ ■ ■ yi] «-> [x G 
(1 < I < G and 1 < k < I) 



■ xi+iUi ■ ■ ■ Vk+iOk 



C2: [x G ■ ■ ■ x 2 yi] [x G ■ ■ ■ x 2 y[] for y Y ^ y[ 

Above and hereafter, we use x for a dummy variable rang- 
ing from to (M — 1) and y for one ranging from 1 to 
(Al — 1). The rule CI comes from the fact that all pe- 
ripheral nodes of A4 are connected to the hub Ofc during 
the replication, and C2 from the fact that all peripheral 
nodes of A/i are fully inter-connected. 

For a convenience, we classify nodes into four sets: (a) 
P for peripheral nodes [yc'-Vi], (b) LP/ (stands for 
Locally Peripheral) (1 < I < G) for nodes of a form 
[x G --- x l+2 y r --yi], (c) IM1 (Local Hub) (1 < I < G) 
for nodes of a form [xg ■ • ■ xi +2 yi+i Oj], and (d) H for the 
hub [Og], The size S of each set is given in Table |. 

Degree distribution — Using the connection rules, one 
can easily enumerate the degree, which is the number of 
neighbors, of each node. All nodes in the same set have 
the same degree, which are presented in Table |. The 
nodes in P and H contribute to the degree distribution 
Pjc at isolated points of K. For nodes in LP/, it is given 
by P K = M~ G Si\Al/AKi\ with Al = 1 and AK t = 
K l+X -Ki = l. Using K and S in Table Q, we find that 
Pjc ~ exp[Aln(l — 1/M)]. On the other hand, nodes in 
IMl has Ki ~ (M - 1)' and Si ~ M~ l . Hence, the degree 
distribution follows a power law Pk ~ A~ 7 for M > 2 
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TABLE I: The degree K, the clustering coefficient C, the partial betweenness centrality B°, and the betweenness centrality B 
of a node in each set and the number of nodes S in each set. 

set S K C B° B 



(M-D G G+(M-2) -2M«(M^)J( i m t ) G - 1 _ 1 



LPZ (M-I)'mW J + (M-2) (l^r)" 1 -! - 2M G { - l}, for I « G 

LHZ (M - l)M G - (i+1) EL=i( M - !)" E , '"^^^ M !_1 - 1 ~ 2M°(M 1 - 1 - 1), for J < G 

TUT 1 CM T\ n (M-2) n I M 2 (M-2) , M 2 1 » f 2( 

M 1 2^„=ll. M ~ E G =l(M -l)"-l U - \ M+T + M'i-1 111 M 2 -M+l J AJ 



with 7=1+ ^7g^Ti ' r ^^ ie n °d es m ^^-^ have larger 
degrees than those in LPZ. Therefore, the total degree 
distribution follows the power law with the exponent 7 
in the tail region. In particular, the hub has the largest 
degree, which scales as Khuh ~ iV 1 ^ 7-1 ). 

Shortest path and diameter — We first consider a 
shortest path from an arbitrary node [xg ■••£].] to the 
hub [Og]- One can reach the hub by flipping the co- 
ordinate using the rules CI and C2 successively. The 
shortest path has the minimum number of steps, which 
is called the distance. Note that a step using C2 always 
leads to a detour. Hence it must not be used in finding a 
shortest path to the hub. Following CI, one may flip the 
consecutive lowest (including x\) figures that are all zero 
to nonzero values, or vice versa. The whole consecutive 
lowest figures Xi- ■ ■ x\ with xi<j<i = (xi<j<i ^ 0) and 
^ {%4+i = 0) will be referred to as a zero (nonzero) 
domain. Then, one can reach the hub in minimal steps 
by flipping a zero/nonzero domain to a nonzero/zero do- 
main successively. The domain size increases at each flip 
until one reaches the hub, a zero domain of size G. Since 
a zero domain can be flipped to any nonzero domain, the 
shortest path has large degeneracy in general. 

The process resembles domain-coarsening in magnetic 
systems. To complete this analogy, we map the coordi- 
nate [xg • ' • x i] onto a spin state of the (Q = M)-state 
Potts model in one-dimensional lattice of size G and as- 
sign the energy with the Hamiltonian 



H = J2{l-S(6(x i ,Q),S{x i+u 0))}, 



(1) 



where i5(, ) is the Kronecker delta symbol and xg+i = 
is a fixed ghost spin. Then the distance between a node 
to the hub is given by the energy of the spin state. There- 
fore the mean node-to-hub distance is given by the av- 
erage energy of the spin system in the infinite temper- 
ature limit: D H (G) = - [din Z((3; G)/dp] /3=0 with the 



partition function Z(fi\ G) = e 



calculated using a transfer matrix method. After some 
algebra, we obtain that 



p g (o= 2(j f-v- ( m - i h m - 2 ) 



M 2 



M 2 



(2) 



A shortest path between arbitrary nodes [xg] = 
[xxg—i] and [x'g] = [x'x'g— 1] can be found recursively. 
If x 7^ x', all paths connecting them pass through the hub 
[Og]- So, the shortest path is given by a shortest path 
from one node to [Og] followed by a shortest path from 
[Og] to the other. If x = x', one can restrict the shortest 
path within a sub-network of all nodes [x"x"g-i] with 
x" = x, for a path utilizing other nodes with x" x 
does not reduce a path length 0. Note that the sub- 
network with all links to other nodes in Mg disabled has 
that same structure as Mg-i- Therefore, the distance 
satisfies the recursion relation 

d([x a ],[x' ]) = 6'(x,x') {d([x G ],[0G}) + d([x' G ],[0G})} 
+ S(x,x') d([x G -i],[*'G-i}) (3) 

where 5'(, ) = 1 — 5(, ) for short. Summing up over all 
node pairs, we obtain the recursion relation D(G) — 
2M-\M - l)D H (G) + M~ 1 D(G - 1) for the diame- 
ter (mean node-to-node distance) with the solution 



D(G) 



4(M - 1)G 2(M - 3) (M 2 - M - 4) 



M 2 



M 



(4) 



In the infinite N = M b limit, we find that D ~ 2D H 
and that the diameter scales logarithmically with N. It 
is a characteristics of the hierarchical network. For con- 
ventional (non-hierarchical) SF networks, the diameter 
scales sub- logarithmically for 7 < 3 jL5|, jig . 

Clustering coefficients — The CC of a node with K 
neighbors is given by C = 2N e /K(K — 1), where iV e is 
the number of existing edges between K neighbors. Using 
the connection rules, it is straightforward to calculate the 
CC of each node. Nodes in the same set have the same 
value of C, which are presented in Table Q. 

Using the results in Table |, the degree dependence of 
the CC is easily obtained. For nodes in P and LPi, we 
obtain that C{K) = (M -2)(2K - M+1)/K(K -1) with 
M - 1 < K < G + (M - 2). So, for large K > M, their 
CC's are inversely proportional to the degrees, C(K) ~ 
2(M - 2)1 K. The CC's of nodes in IMl and the hub are 
exactly given by C{K) = (M — 2)/{K — 1). The scaling 
law C ~ cK^ 1 holds for both cases, but the coefficient c 
differs by a factor of two (cf. Fig. 2(b) in Ref. ||). 
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The average CC is given by C = M~ G (S r C r + 
^(SlkClp; + Slhz Clh; ) + S M C M )- In the infinite size 
limit (G — > oo), it converges to a nonzero value 



C 



2 -A (2Z + M-3)(1-1/M)' 
f^i {l + M-2)(l + M-3) 



^ OO 



(M - 2f 



' M l ({M - 1Y+ 1 -2M + 3)' 



(5) 



Numerically C = 0.719282... and 0.741840... for M = 4 
and 5, respectively. C converges to 1 as M — > oo. 

Betweenness centrality - The BC of a node is the 
sum of weights of shortest paths between all node pairs 
that pass through the node. For a given node pair, all 
degenerate shortest paths connecting them are weighted 
with the inverse of the degeneracy. First of all, we calcu- 
late a so-called partial BC B°, which is obtained from a 
partial sum over all shortest paths between the hub and 
the others. It is calculated easily, since all shortest paths 
can be constructed using the domain-coarsening picture. 

Again, each node in the same set has the same value of 
B°. (a) A node u = [y G ■ • • J/i] in P may belong to a short- 
est path to the hub from nodes [yQ ■ ■ ■ yi +2 0x[ • • ■ Xj] with 
< I < G — 2 and arbitrary x[ . The domain-coarsening 
process leads them to [yc • • ■ IJ1+2O1+1] at an intermediate 
step. Then, the zero domain 0;+i flips to a nonzero do- 
main in the next step with probability [M — l)~( i+1 ' 
passing through the node u. Hence, each node con- 
tributes (M — 1)~(' +1 ) to B®. Summing up all contri- 
butions, we obtain the result in Table Q. (b) A node 
[xq ■ ■ ■ xi + 2^Ui ' ' ' Vi] hi LPZ may belong to a shortest 
path from nodes [x G • • ■ xi +2 ^yi • ■ • y m +2^x' m ■ ■ ■ x' ± ] with 
< m < I — 2 and arbitrary x\ to the hub. Fol- 
lowing the same idea as in (a), one can easily obtain 
see Table |. (c) A node u = [xg ■ • • xi +2 yi+\ Qi] 
in LHZ belongs to "all" shortest paths from nodes 
[xq ■ ■ ■ xi+2yi+i0x'i_i ■ ■ ■ x[] with arbitrary x[ except for 
u itself. So, B° u = M 1 - 1 - 1. (d) Trivially, B° = for 
the hub. 

By eliminating the parameter / in Table [j], we ob- 
tain that B°(K) = (M/(M ~ \))K-m+i for nodeg in 
set LPL It is diverging exponentially with K. On the 
other hand, Pk for nodes in UPl decays exponentially. 
So, Pgo decays algebraically. Explicitly it is obtained 
from P B o = M- G Si\Al/AB?\ : which yields that 



Pro 



M -1 

M 



{B° + l) 2 



(6) 



We obtain the relation B°(K) = -1 + M^l + K(M - 
2)(M - l)- 1 )''- 1 for nodes in the set LB. So the distri- 
bution is given by 



Pro 



M 2 (B° + l) 2 



(7) 



Therefore, the partial betweenness centrality has the 
power-law distribution with the exponent 2. 



The mean node-to-node distance was obtained us- 
ing the mean node-to-hub distance. We apply a sim- 
ilar idea for the BC. Introduce a character function 
Xg([ x'g], [x"g]> [ x g]) to denote the fraction of paths 
passing through [xq] among all shortest paths be- 
tween [ x'g] and [ x"g] in Mg- The [xg] depen- 
dence will be assumed implicitly. Then, the BC of 
a node [xq] = [xxa—i] can be written formally as 
B lx G ] = Y,x> g ,x g Xg([x g ],[x' g ]). The hub will be 
considered separately later, and we assume that [x G ] 
is not the hub for the time being. Decompose the 
sum Y^x'a ^x' G an( l similarly for x' G . 

When x' ^ x", all shortest paths between [x' x' G _ 1 ] 
and \x" x G _ 1 ] pass through the hub, which yields 
that X g(W x 1 ^}, \x" = X g([x' x'^], [0 g ]) + 

Xg([x" x g _ 1 ], [0 g ]). When x' = x" and [x G ] is not the 
hub, the summands are nonzero only when x' = x" = x. 
Using these properties, we obtain that 



B [Xg] = 2(M-1)M G - 1 B° 



+ 



E 



Xg([xx' c 



G-U) 



\xx f 



-l])' (8) 



B 



One might be tempted to identify the second term as 
[x G -\]i i- e -' tne BC defined on Ng—x- However this 
is not correct for x 7^ 0, since some pairs of [aJ^-J 
and [a^.jj may have degenerate shortest paths passing 
through the hub [0 G ] which is not present at Ng-i- This 
was explained when we discussed the diameter. For ex- 
ample, [yllll] and [z/1222] with nonzero y are connected 
via [ylOOO] and [yOOOO], and also via [00000], so each path 
has the weight 1/3. However, if one ignores the path 
via [00000], the other paths would have weight 1/2, and 
hence the nodes [ylOOO] and [j/0000] would have larger 
value of the BC. 

Therefore, the second term in Eq. (^) should be written 
as a sum of B^ Xg _ 1 ] and a quantity that compensate for 
the change in the degeneracy of the shortest paths. As 
the example showed, the compensation is necessary only 
for nodes of the form [x G ] — [y G ■ ■ ■ yi+iOi] with 1 < 
I < G. For such nodes, after enumerating all degeneracy 
carefully, Eq. (||) becomes 



B 



Xg] = 2(M-1)M „ [x 



tG-i b o 



B, 



X G -i] 

(M-2)(Af-l) fc M 2 ( ; - fc ) 
^ (G-l + k)(G-l + k-l) ' 



(9) 



The hub gains from such degenerate shortest paths, 
which lead to a similar recursion relation 



B iO G] 



(M - 1)(M 



2G-1 



2M 



G-l\ 



B 



[Oa-i] 



G-l 



+ (M-2)M^^fc#. 

k=2 

For other nodes, we have the simple relation 



(10) 



B [Xg] = 2(M - l)M G - 1 Bp !Bo] + B [Xa _ l} . (11) 
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With the recursion relations, we now readily calcu- 
late the BC of all nodes exactly. Since the exact ex- 
pressions are lengthy, we present only the leading order 
contribution in the large G limit in Table |. Note that 
B ~ 2M G B° for most nodes in LP/ and LH/ in the large 
G limit. The proportionality relation implies that the 
BC follows the same power-law distribution as B°: 



Pn 



B- 2 



(12) 



Summary and discussions — We have studied the ex- 
act scaling properties of the H network introduced in 
Rcf . |5| . We have shown that the H network has the clus- 
tering property: The average value of the CC is nonzero 
in the infinite network size limit and the CC exhibits 
the scaling law C ~ with (3=1. We have also 

shown that the BC follows the power-law distribution 
Pb ~ B~ v with rj = 2. Both scaling properties charac- 
terize the H network model. 

The BC proved to be useful in classifying SF networks 
into the universality class. The BC distribution expo- 
nent is universal and has the value either r\ ~ 2.2 in 
the class I or rj ~ 2.0 in the class II |l3|] . Combining 
the scaling properties of the CC and the BC, we suggest 
that there exist four classes, that is, I-C, I-NC, II-C, and 
II-NC ("C" for clustered and "NC" for non-clustered net- 
works) [[17]. The H network model then belongs to the 
class II-C. The Internet at the autonomous system level 
and some metabolic networks of archaea display both 
scaling behaviors with (3 ~ 1.0 || and rj ~ 2.0 Q. So 
they belong to the same class II-C, which is a stronger 
evidence for a hierarchical structure || . The B A network 
with m = 1 [| and the deterministic tree network pl| 
have C = and r\ = 2 jll| , thus they are members of the 
class II-NC. The BA network with m > 2 has vanishing 
CC [| and r) ~ 2.2 @, and belongs to the class 
I-NC. 



Literatures suggest that the metabolic networks of bac- 
teria and eukaryotes j?], [ll| and the coauthorship network 
in the field of neuroscience |l^, ||(| might belong to the 
class I-C. With the existence of the class I-C, the clus- 
tering property does not necessarily imply the hierarchi- 
cal structure. Therefore, it is important to establish the 
class I-C firmly. Further studies on the BC distribution 
in model networks with the clustering property, such as 
the Holme-Kim model Q and Klemm-Eguiluz model || , 
are required. Further studies are also necessary to reveal 
the similarity/dissimilarity between the metabolic net- 
works of archaea and those of bacteria and eukaryotes, 
which have different BC distributions. 

Goh et al. (ll| suggested that the topology of short- 
est pathways be a universal characteristics of SF net- 
works. They found a chain-like structure for networks in 
the class II {q = 2). It was presumed that the chain- 
like structure leads to a linear mass-distance relation 
m(d) ~ Ad where m(d) is the mean number of nodes 
along all shortest paths between a node pair separated 
by a distance d. We also found that the shortest path- 
ways of the H network have a chain-like structure. For 
example, a set of all shortest paths from [001010] to the 

hub in Af 6 is given by [001010] ^ [00101?/] A [0010 3 ] 

[OOlyyy] — > [Oq] with arbitrary nonzero y's. It has a 
chain-like structure, but the steps 1 and 3 introduce blobs 
whose size increases exponentially as one proceeds. We 
could show that m(d), averaged over all nodes separated 
by the distance d from the hub, satisfies an inequality 
m(d) > a(M — l) d with a positive constant a |21 . So a 



topological characterization other than the mass-distance 
relation is necessary to characterize the chain-like struc- 
ture observed in the class II-C. 
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